Internet Info 1997 December

home *** CD-ROM | disk | FTP | other *** search

/ Internet Info 1997 December / Internet_Info_CD-ROM_Walnut_Creek_December_1997.iso / ietf / urn / urn-archives / urn-ietf.archive.9611 / 000016_owner-urn-ietf _Mon Nov 4 05:51:04 1996.msg < prev next >

Wrap

Internet Message Format | 1997-02-19 | 3KB

Received: (from daemon@localhost) by services.bunyip.com (8.6.10/8.6.9) id FAA27406 for urn-ietf-out; Mon, 4 Nov 1996 05:51:04 -0500 Received: from mocha.bunyip.com (mocha.Bunyip.Com [192.197.208.1]) by services.bunyip.com (8.6.10/8.6.9) with SMTP id FAA27401 for <urn-ietf@services.bunyip.com>; Mon, 4 Nov 1996 05:51:02 -0500 Received: from josef.ifi.unizh.ch by mocha.bunyip.com with SMTP (5.65a/IDA-1.4.2b/CC-Guru-2b) id AA21183 (mail destined for urn-ietf@services.bunyip.com); Mon, 4 Nov 96 05:51:00 -0500 Received: from ifi.unizh.ch by josef.ifi.unizh.ch id <00908-0@josef.ifi.unizh.ch>; Mon, 4 Nov 1996 11:50:37 +0100 Subject: Re: [URN] %encoding for reserved UTF-8 characters (was: New syntax draft) To: jayhawk@ds.internic.net Date: Mon, 4 Nov 1996 11:50:36 +0100 (MET) Cc: urn-ietf@bunyip.com In-Reply-To: <9611011512.AA05839@mocha.bunyip.com> from "Ryan Moats" at Nov 1, 96 09:12:22 am Mime-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: 7bit Content-Length: 1322 From: Martin J Duerst <mduerst@ifi.unizh.ch> Message-Id: <"josef.ifi..324:04.10.96.10.50.51"@ifi.unizh.ch> Sender: owner-urn-ietf@services.bunyip.com Precedence: bulk Reply-To: Martin J Duerst <mduerst@ifi.unizh.ch> Errors-To: owner-urn-ietf@bunyip.com Ryan Moats wrote: >>Now for UTF-8, things are quite different. 8-bit bytes will >>on many occasions be escaped because it may be difficult to >>represent them otherwise. Having some character beyond ASCII >>represented with %HH (usually %HH%HH or %HH%HH%HH) can in no >>way imply that this is a special character. >>This means that any tools dealing with URNs in general will >>have no clue about where to keep the escaping, and where >>to remove it. A very exact knowledge of each NSS syntax >>would be needed. > >My brain hurts, but I think I finally understand the issue. Sorry for my long explanations. >Allow me to restate it to see if I'm right: > >The problem you are talking about arrises if the reserved character has >a UTF-8 representation of more than 1 octet. Then, if we use %encoding >to represent the character in a literal use, there is no way of determining >from the URN whether the character is being used as a literal character >or not. Exactly. >I'll save a discussion of potential solution directions to this until I'm sure >I understand the issue. I have already mentionned two possibilities: - Specify that protocols may only reserve 1-octet UTF-8 characters (i.e. ASCII). - Specify that protocols have to define their own escaping mechanisms for things beyond ASCII. Regards, Martin.